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Abstract — This paper proposes a dynamic primal-dual type 
algorithm to solve the optimal scheduling problem in wireless 
networks subject to uncertain parameters, which are generated 
by stochastic network processes such as random packet arrivals, 
channel fading, and node mobilities. The algorithm is a gen- 
eralization of the well-known max-weight scheduling algorithm 
proposed by Tassiulas et al, where only queue length information 
is used for computing the schedules when the arrival rates 
are uncertain. Using the technique of fluid limits, sample path 
convergence of the algorithm to an arbitrarily close to optimal 
solution is proved, under the assumption that the Strong Law 
of Large Numbers (SLLN) applies to the random processes 
which generate the uncertain parameters. The performance of the 
algorithm is further verified by simulation results. The method 
may potentially be applied to other applications where dynamic 
algorithms for convex problems with uncertain parameters are 
needed. 

I. Introduction 

Scheduling in wireless networks involves efficiently allo- 
cating network resources among competing network users 
in the presence of uncertainties. These uncertainties may be 
either due to unexpected events, such as link failures, or due 
to intricate cross-layer interactions in wireless networks. For 
example, the packet arrival rates may be unknown (e.g., |1|, 
llll), which depend on upper layer dynamics such as routing 
and congestion control protocols. As another example, the 
wireless channel statistics may be also unknown (e.g., ||2|), 
since they depend on complex network events such as channel 
fading, power control and node mobilities. 

In the presence of such uncertain parameters, it may no 
longer be optimal to use the static allocation approach (e.g., 
131), which produces periodic schedules by solving a static 
underlying convex optimization problem (which usually has 
exponential size) with estimated uncertain parameters. In par- 
ticular, if these uncertain parameters are slowly converging, 
or time varying, the estimated parameters may fail to track 
the changes in their true value, which often leads to subop- 
timal schedules. Further, it may be impractical to estimate 
the uncertain parameters for large wireless networks, as the 
number of the parameters may grow fast (e.g., exponentially) 
with the size of the network. For a simple illustration, consider 
a wireless network with n links, such that each link randomly 
switches on or off after certain random number of time slots. 
In such case, a complete specification of the network topology 



probabilities may require as large as 2" — 1 parameters, which 
quickly becomes impossible to estimate as n grows. 

On the other hand, online algorithms (such as fl], (21, 
El, III) are more robust to the changes to the uncertain 
parameters (such as arrival rates), since they use queue length 
(which can also be interpreted as prices f9l, fW\) information 
for scheduling in each time slot. For example, it has been 
shown that lH such algorithms can achieve network stability 
even if the "instantaneous rates" of the traffic vary arbitrarily 
inside the network capacity region. Further, compared to the 
estimation based approach, these online algorithms may be 
more scalable to the network size, in the sense that the 
dimension of the queue length vector corresponds to the 
number of constraints (such as rate constraint for each link), 
which usually grows slowly, whereas the number of uncertain 
parameters can grow very fast (e.g., exponentially). 

In this paper we solve a general class of optimal wire- 
less network scheduling problems with uncertain parameters, 
whose underlying static problem is described by the convex 
optimization problem OPT in Section [III Essentially, we 
require that the structures of the convex objective functions and 
convex constraint functions are known, except the values of 
the uncertain parameters. These parameters will be generated 
by certain stochastic processes and observed by the network 
gradually over time slots. We propose a greedy primal-dual 
dynamic algorithm (Algorithm [T] in Section Ullb to achieve the 
optimal scheduling asymptotically. Using the novel technique 
of fluid limits [12|, optimality can be guaranteed under the 
assumption that all the network processes generating the 
uncertain parameters satisfy SLLN (see details in Section HH). 
Note that this assumption is quite mild, since we can guarantee 
optimality as long as these processes converge, no matter 
how slowly the convergence happen. Thus, intuitively, our 
algorithm can automatically track the convergence of these 
processes and correct the mistakes which are made within any 
finite time history. 

Our algorithm is a generalization of the well-known max- 
weight algorithm HI, which was shown to be throughput 
optimal for i.i.d arrival processes. Our algorithm is related to, 
but different from the utility-optimal scheduling algorithm by 
Neely ||2|, which achieves the optimal scheduling by cleverly 
transforming the problem into optimizing the time-average of 



the utility (and constraint) functions, to which a dual-type 
algorithm applies. Our algorithm is also different from the 
primal-dual algorithm by Neely |5| since we use a scaled 
queue length (by 1/t) in the scheduling, which corresponds to 
the approximated gradient. Stolyar |6| also proposed a primal- 
dual type scheduling algorithm, and proved its optimality using 
a fluid limit obtained from a different scaling. Since the fluid 
scaling in |6| is taken over different systems, it is hard to relate 
the optimality in the fluid limits to the one in the original 
system. Finally, our algorithm can be used as a MAC layer 
solution for the general framework of cross-layer optimization 
problem (e.g., |4j, [7J, [8J, [lOj) for wireless networks. 

The organization of the following sections is as follows: In 
Section In] we describe the queueing network model as well as 
the optimization problem OPT, and in Section we describe 
the scheduling algorithm. Section HV] proves the optimality of 
the algorithm. Section [V] illustrate the algorithm performance 
in simulation, and finally Section IVTl concludes this paper 

n. Network Model and Problem Formulation 

In this section we describe the queueing network and 
propose the optimization problem OPT. We first introduce 
the queueing network model. 

A. Queueing Network 

We consider the scheduling problem at the medium access 
(MAC) layer of a multi-hop wireless network, where the 
network is modeled as a set of n links. We assume a time- 
slotted network model, and in each time slot t, the network 
is in one of the following states: M = {1,2, . . . , M}. The 
network state can be used to model network topology, channel 
fading, and user mobility, etc. We further assume that these 
network states can be measured by the user nodefl which 
are assumed to be equipped with channel monitoring devices. 
We associate each network state to e with a finite set 
of resource allocation modes S*-™-* = {^J™"*, C2™^ • ■ •}' where 



each mode ^[.'"■' e ^l™) corresponds to a configuration of 
network resource allocation, such as carrier and frequency 
selection in OFDM systems, spreading codes choice in CDMA 
systems and time slots assignment in TDMA systems. 

Denote y as the uncertain parameters, which are generated 
by the stochastic process Y{t), which is a cumulative vector 
process whose time average converges to y. Specifically, the 
assumptions on Y{t) are: 1) it is subject to SLLN, i.e., with 
probability 1 (w.p.l). 



lim Y{t)/t = y 

t^OO 



(1) 



and 2) it has uniformly bounded increment in each time slot: 

\\Y{t)-Y{t-l)\\<Y„,,^,yt>0 (2) 

' Note that although the network states are global, they often allow (approxi- 
mate) decompositions (e.g., [2J) according to either the geographic structure or 
channel orthogonality, in which case one only needs to measure local network 
states. 



where Y,nax > is a finite constant. For specific examples, 
consider the cumulative network state process Mm{t)'- 

t 

Sm{t) ='Y^'^{m{T)=m} (3) 

T=l 

where ■m{t) is the network state at time slot t, and is 
the indicator function, i.e., l{true} = 1 and l{faise} = 0. Thus, 
SLLN implies that w.p.l, 

1 1 * 

lim -S.nit) = lim - V t{m(r)=m} = 7r("), Vto e 7W 

As another example, consider the external packet arrival pro- 
cess A{t), which is a n X 1 vector representing the cumulative 
external packet arrivals during the first t time slots. Similarly, 
SLLN implies that w.p.l. 



lim A{t)/t = a, w.p.l 



(4) 



Further, we require that the maximum packet arrivals in any 
time slot are uniformly bounded: 



\A{t) - A{t - < A„,ax, Vt>0 



(5) 



where Amax > is a finite constant. 

We finally describe the queueing system model. The queu- 
ing dynamics of the network is modeled as follows: 



Q(t) = g(0) + A{t) ~ R{m{t))D{t) 



(6) 



where Q{t) is the queue length vector at time slot t, and 
R{m{t)) is the nxn routing matrix, such that Ru{m{t)) — 1, 
and Rij{m{t)) = — 1 only if link i serves as the next hop for 
link j at time slot t, as specified by certain routing protocols, 
otherwise Rij{m{t)) = 0. Note that the routing matrix 
R{m{t)) is a function of the network state, and therefore 
SLLN implies 



1 * 

lim -yi?(TO(T))= y 7r(")i?(TO) 



(7) 



is a n X 1 vector representing the cumulative packet 
departures during the first t time slots, which are determined 
by the resource allocation modes as specified by the scheduler 
in each time slot. Specifically, at each time slot t with 
network state m, if the scheduler chooses a resource allocation 
mode there is an associated departure vector G')^\ 

whose each entry G^™^ corresponds to the number of packets 
transmitted successfully by link i. Note that the choice of 
resource allocation mode ^^'"^ is subject to the constraint that 
Q{t) - G^"^ > 0, so that the queue lengths never become 
negative. Note that this constraint can be easily satisfied in 
general systems. For example, if the allocation mode ^^™'' 
corresponds to independent sets of the interference graph 
(see, for example, [7|), one can simply transmit the subset 
of links with nonempty queues, which are still independent. 
In a compact form, we can express the departure process as 



D{t) = ^ G(")T(")(t), 



(8) 



where G'-™^ is a matrix whose columns are 
is a vector whose each entry Tjf^^t) corresponds to the 
number of time slots that resource allocation mode is 
chosen during the first t time slots. 

A basic requirement on the scheduler is that it should 
achieve rate stability fT2^, i.e., 

lim D{t)/t ^ a (9) 

SO that the departure rate of each link is equal to the arrival 
rate, as required by the underlying static optimization problem 
OPT, which we formulate in the next subsection. 

B. Optimization Problem 

In this section we introduce the optimization problem OPT, 
which is implicitly solved by the optimal schedulers. The 
problem OPT is as follows OPT: 

OPT: min f{x;y) (10) 

X 

s.t. h{x;y)^0 (11) 

xirri) ^ 0, l^a;(™' = 1, Vm £ X (12) 

In the above formulation, a;^™) as a resource allocation vector 
when the network state is m. That is, each entry a;^™^ is the 
asymptotic time fraction (assuming the limit exists for now) 
that resource allocation mode ^[™^ is chosen, during the time 
slots where the network state is m. Thus, cc^") is subject to 
the simplex constraint ( fT2] i. x = {x^^\x'^^\ . . . jx'-^^^) is a 
big vector representing the total resource allocation vector as 
specified by the scheduler f{x; y) is a general convex cost 
function of variable x, and h{x; y) is a vector of general 
convex constraint functions of variable x. The additional 
parameter y represents the uncertain parameters, which is 
valid under the assumption that the corresponding processes 
are subject to SLLN. Finally, we assume that both f{x;y) 
and h{x; y) are continuously differentiable as functions of 
variables [x, y). 

The formulation of OPT is quite general, which can be used 
to model various applications in the literature. For example, 
if we want to minimize the total transmission power, we can 
choose y = {tt,p), and choose the cost function as follows 

f{x;y)= 7r(")p(™)'^a;(™) (13) 

where the time fraction that the network state is to, and 

is a power vector where each entry corresponds to 
the power consumption when resource allocation mode 
is chosen at network state to. Thus, the cost function in il3[ 
can be interpreted as the average power consumption by the 
scheduler. Note that we can also encode the power constraint 
into h{x,y) by choosing y = {n,p) and then choosing 

h{x-y)^ ^ Tr^^^pf^^a;^") -p (14) 

where P^™) is a diagonal matrix where each diagonal entry 
P^™' corresponds to the power consumption when resource 



allocation mode is chosen when the network state is 

TO, and p is the power constraint vector. In this case, (fl4] i 
is equivalent to requiring a constraint of p on the average 
transmission power. In order to encode the network stability 
constraint, we can choose y = (tt, a) and then choose 

h{x-y) = a- 7r(")p(m)G(™)a;(") (15) 

Thus, (flSl l requires that the average external and internal 
arrivals should be less than the average departures, in which 
case the network is rate stable. 

III. Scheduling Algorithm 

In this section we will describe the algorithm to solve 
OPT. As a standard approach in solving constrained convex 
optimization problems fTT), we transform OPT into another 
static "penalized" problem, PEN, to which our scheduling 
algorithm can directly apply. Based on this, we then introduce 
the scheduling algorithm which solves PEN and, therefore, 
also solves OPT. 

A. Transformed Problem 

Assuming that OPT is strictly feasible, we first change the 
constraints in (fTTl i as follows 

h{x; y) + z = Q, el ^ z ^ z^axl (16) 

where e > is a small scalar, and z„iax > is a sufficiently 
large constant such that the inequality and equality constraints 
are equivalent. Denote /* as the optimal cost when the 
constraint is changed to (fTSI l. Thus, the optimal value of OPT 
is /q . We have the following sensitivity lemma stating that /* 
is a good approximation of /q with sufficiently small e. 

Lemma 1 ([11 J): Denote Ao and as two Lagrangian 
multipliers for /q and /*, respectively. We have 

l/o <eniax(||A,||i,||Ao||i) (17) 

We next define the transformed problem as follows: 

PEN: min g{x, z; y) ^ f{x; y) + Pp{x, z\ y) 

X.Z 

s.t. el<Z< Zmaxl 

a.("0 >_ 0, l^a;("') = 1, Vm £ X 

where /? is a large constant to control approximation accu- 
racy, and p{x,z;y) corresponds to the penalty term, which 
corresponds to various standard penalty functions [llj, e.g., 

p{x,z;y)^-\\h{x;y) + z\r (18) 
a 

for a > 1. In particular, the standard Lyapunov drift analysis 
(e.g., fT\, f2\, f4l, |7|) con-esponds to the case a = 2. 

Denote {Xp, Zp) as a solution of PEN. We have the follow- 
ing result holds: 

Lemma 2: f{x*;y) < /*. 



Proof: Denote {Xq^Zq) as a solution of OPT with con- 
straint in ( fTTT l replaced by (fTST l. We have 

f{xl-y) < f{x;;y)+(3p{x;,z;;y) 
< f{x*;y)+l3p{xl,zl;y) 

f: 

where (a) is because {Xp, z^) solves PEN, and (b) is because 
{x*, z*) satisfy the constraint ( fTSI l. Thus, the claim holds. ■ 
In the following we will focus on solving PEN, since 
Lemma [T] and Lemma |2] guarantee that PEN achieve an 
objective function value which is arbitrarily close to the 
optimal in OPT. We next describe the scheduling algorithm. 

B. Algorithm Description 

The problems OPT and PEN are static. On the other hand, 
the network is dynamic, and must be described by time series. 
Therefore, before describing the algorithm, we need to define 
dynamic counterparts of the static variables x,y and z. We 
first define empirical resource allocation variable 



Algorithm 1 Optimal ScheduUng 



x 



(m) 



it) 



, Vrn e M 



(19) 



i.e., each entry xj^'{t) corresponds to the time fraction that 
resource allocation mode is chosen during the first t time 
slots, when the network state is m. Note that we have 

x'^"'\t) h 0, l'^a;(™)(t) = 1, Vm e X (20) 

Thus, a;(™)(t) can be interpreted as the empirical value of 
ajf™) which is defined in PEN. Similarly, we denote the 
empirical value of the uncertain parameter y as 



y{t) - Y{t)/t 



(21) 



i.e., y{t) is formed by directly taking the average of the 
process Y{t). Further, define the empirical value of z as 



z{t) = Z{t)/t 

where the cumulative process Z{t) is defined by 

t 



(22) 



and u{t) is computed by the scheduler in Algorithm [T] 

Finally, we introduce some notations. Denote Vm and Vz 
as the gradient operator with respect to variables a;'™' and z, 
respectively. Further, with an abuse of notation, we use the 
following abbreviated notations: 

fit) ^ f{xit);yit)) 

Pit) 4 pixit),z{t);yit)) 

git) ^ gixit),zit);yit))^ fit)+Ppit) 

The algorithm is described as in Algorithm [T] Essentially, 
the algorithm updates the variables a;(<) and zit) by comput- 
ing descent directions v^^'^\t) and uit) in Step 1 and Step 2, 
respectively, where v'™' (i) is an all-zero vector except an one 



Step 1. At each time slot t with network state m, choose 
allocation mode where 

k e argmm (V„/(t) + /3V™p(i))^. 

Step 2. Choose variable such that 

e if iV^pit)), > 



(24) 



Uiit) 



else 



(25) 



and update variables a;(t), and zit) accordingly. 



at the fc-th entry. Further note that constraint >r Cj;,™' can 
be satisfied implicitly with regular cost and penalty functions, 
i.e., assuming the cost for transmitting a set of links is always 
no smaller than that of transmitting any of its subsets. 

From the definition of a;(t) and zit), these processes are 
naturally updated as follows 



x^'>it) = x'^'>it-l), l^m 
x( 



x^"'Ut) = a;(")(t-l)+ ^ } , — - a;(™)(t - 1)) 

^ ' ^ ' 1 T (t) \ // 

zit) = zit-l) + ^iuit)-zit-l)) 

Thus, Algorithm [T] can be viewed as a stochastic gradient 
algorithm for PEN, where the randomness comes from the 
time varying functions fit) and pit), which are subject to the 
changes in uncertain parameters yit). 

The optimization of (l24l i requires tracking the variables xit) 
and yit), in general. However, in applications the structure 
of the cost function fix; y) and penalty function pix; y) 
often allows a much simpler computation. For example, in 
the important case of optimal power scheduling, where the 
cost function is formulated as (fT3] l and the constraint is as 
(fTSI l with the typical value a = 2, we have 

V„/(t) + PVrrMt) = ^("^ WIP^") + ^(i?(™)G('"))^Q(i)) 



^^^^ where tt^™) (t) 



-.rrrr^in,^-. is the empirical time fraction 

of network state m. Thus, the optimization in (124b essentially 
only requires the queue length information (note that nmit) 
becomes an irrelevant scaling factor in the optimization). In 
particular, if we are only interested in the rate stability, i.e., 
setting the objective function as fix; y) — 0, the optimization 
in ([24b is equivalent to 



k e argmin {G^"''^'^ iRim)^ Qit))) 



(26) 



which is the same as the max-weight back-pressure algorithm 
proposed by We finally conclude this section by the fol- 
lowing lemma, which formally shows, essentially, the descent 
property of Algorithm [T] 

Lemma 3: The following properties hold for Algorithm [l] 



1) t)(™)(i) solves the following problem 

GRAD-X: min V,„g(t)^v(™) (27) 
s.t. ^ 0, = 1 

2) tt(i) solves the following problem 

GRAD-Z: min V^g{tfu (28) 

u 

s.t. el ^ M ^ ^maxl 

Thus, the variables {v,u) computed by Algorithm [T] can be 
interpreted as the points in the feasible region of PEN which 
achieves the minimum inner product with the corresponding 
(stochastic) gradients. 

Proof: For 1), note that GRAD-X is a Linear Pro- 
gramming (LP) problem over a simplex, and therefore the 
solution can be obtained at a vertex [iT^ with the minimum 
directional derivative. For 2), note that GRAD-Z is an LP 
over a hypercube, and therefore the solution is obtained at 
the boundary points. Thus the claim follows by noting that 
Vz5(t) = Vzp(i), since /(•) is not a function of z. ■ 

IV. Optimality Proof 

In this section we will prove the optimality of Algorithm 
[U There are two issues to consider: 1) We need to show that 
Algorithm [T] achieves the optimality of OPT asymptotically, 
and 2) We need to show that Algorithm [T] is feasible for 
OPT, i.e., constraint (fTTb can not be violated. We first briefly 
introduce fluid limits, which serves as the key technique for 
the optimality proof. 

A. Fluid Limits 

We extend the domain of all processes to continuous time by 
linear interpolation, and define the fluid scaling of a function 

l{t) as 

f{t) ^ l{rt)/r (29) 

where / can be functions T, Y and Z. It can be shown 
that these scaled functions are uniformly Lipschitz-continuous. 
Thus, according to the Arzela-Ascoli Theorem lfT3l . any 
sequence of functions which is indexed by {r„}^]^, i.e., 
(T''", l^'^", Z''^"), contains a subsequence {r„^}5^L]^ which 
converges uniformly on compact sets to a set of absolutely 
continuous functions (and, therefore, differentiable almost 
everywhere lil3J ) (T, Y , Z). Define any such limit as a fluid 
limit. (Note that fluid limits are denoted by a bar) We next 
state some properties of the fluid limits. 

Lemma 4: The processes in any fluid limit satisfies the 
following: For any t > 0, we have w.p.l, 

el < Z{t)/t < z,„axl (30) 

and the foHowing properties hold w.p.l: For all t > 

Y{t) = yt (31) 
l^T^"'\t) = 7r(™)t WmeM (32) 



Proof: (|30] l follows from Algorithm [T] where each Ui{t) 
is chosen between e and Zmax- (1311 1 and (l32T i follows directly 
from the (functional) SLLN. ■ 
We next define the resource allocation variables and auxil- 
iary variables in fluid limit as follows (one can compare with 
(fT9] l and (|22] | for similarities) 

= -^T'^"'\t) (33) 
z{t) = Z{t)/t (34) 

Similarly, define the following variables as the counter parts 
of and u{t) in Algorithm [T] 

^ r(")(i)/7r(™' (35) 
u{t) = 2{t) (36) 

We have the following lemma holds, which states that both 
{x{t),z{t)) and {v{t),u{t)) are feasible points for PEN. 
Lemma 5: For any fluid limit and t > 0, we have 

1) {x{t),z{t)) is feasible for PEN: 

el ^z{t)^ z„iaxl (37) 
i^s('")(i) = >r 0,Vto e X (38) 

2) {v{t),u{t)) is also feasible for PEN: 

el ^ u{t) ^ z„iaxl (39) 
(t) = 1, {t) >r 0, Vto e X (40) 

3) The derivatives of a;('™'(t) and z{t) are 

= {v^"'\t)~x^'^\t))/t (41) 
%{t) = {u{t)--z{t))/t (42) 

Proof: For 1), dJTl l follows from applying (|30] | to the 
definitions of z{t), and (|38] | follows from applying (|32] | to the 
definition of x''™-\t). Similarly we can prove 2), by noting 
that erl < Z{t + r) - Z{t) < ZmaxTl for any t > and 
T > 0. 3) follows from direct calculation. ■ 
We are now ready to prove the optimality of Algorithm [T] 

B. Optimality Proof 

For the ease of presentation, we use 

g(t)=9{x[t)rz{i);y{t)) (43) 

as a short-hand notation (note that they are the functions in 
fluid limits), with an abuse of notation. We next establish the 
following key technical lemma, which, essentially, extends the 
optimality property in Lemma [3] to the fluid limits. 

Lemma 6: Let a fluid limit (T, Y, Z) and m e M,t > Q 
be given. The following properties hold: 

1) solves the following problem 

GRAD-XBAR: min V „,g{tf v^'""^ (44) 
s.t. i3(™) ^ 0, l^i)('") = 1 



2) u{t) solves the following problem 

GRAD-ZBAR: min V^g{tfu (45) 

u 

S.t. el ^ M ^ Zmaxl 

Thus, the optimality in Lemma [3] still holds in fluid limits. 
We first outline the proof. For 1), since GRAD-XBAR an LP 
over a simplex, the optimum must correspond to the vertices 
with the smallest gradient. Thus, it is sufficient to prove that 
any resource allocation mode j will have w^^™-' {t) = if there 
is a fc such that 

(V„,g(t)), > (V„,g(t))fc (46) 

which follows from the optimality shown in Lemma [3] along 
a convergent subsequence. For 2), we will prove that for any 
feasible points u of ( l45T l. we have 



(47) 



which also follows from the optimality in Lemma [3] along a 
convergent subsequence. 

Proof: For the clarity of presentation, the proof is moved 
to the Appendix. ■ 

Based on the above lemma, we are now ready to prove that 
Algorithm [T] achieves the optimal cost in the fluid limit. 

Lemma 7: For any fluid limit, we have for all i > 0, 



g{x{t),z{t)-y{t))=g* 



(48) 



where g* ~ f{xp; y) + /3p{xp, z^; y). Thus, the optimality is 
achieved in the fluid limit. 

We first outline the proof. Note that it is always true that 



g{x{t),z{t)-y{t))>9* 



(49) 



since {x{t),z{t)) are always feasible points of PEN. Thus, 
the claim holds if we can prove the reverse direction. This 
can be done by defining a proper "Lyapunov" function 



L{t)=tg{x{t),z{t)-y{t)) 



(50) 



and show that L{t) < g* ,hy using the properties in Lemma 
|6]and the convexity of function g{-). 

Proof: Consider the "Lyapunov" function as in dSOl l in 
any fluid limit. From Lemma |5] we know that for any t > 
0, {x{t), z{t)) are feasible for PEN, and therefore we have 
L{t) > g*t due to the definition of g*. On the other hand, 

m = m+tm 

= m + tV^-gitfHt) + t V„,5(t)^s('")(i) 



(a) 



{b) 



g{t)+tV,g{tf{u{t)-z{t)) 

+ E V™gW^(t3('")(t)-sM(i)) 



< g{t)+tV,g{tf{z*p-Ht)) 

+ y: Vm-g{tfix^r^*-x("^\t)) 



< g(xl,ziyJg* 



where (a) is obtained by substituting the equation in Lemma 
|5] (6) is from Lemma |6] i.e., and u[i) are solutions 

of GRAD-XBAR and GRAD-ZBAR, respectively, (c) is due 
to the convexity of function g{-), and (d) is because (ajpjZp) 
is the solution of PEN, by definition. Thus, we have 

L{t) = L{0)+[ L{T)dT<g*t (51) 



from which we conclude that L{t) = g*t. ■ 
Having established the optimality in the fluid limit, we 

are now able to prove optimality in the original system. The 

following theorem states that Algorithm[T] achieves the optimal 

cost in the original network. 

Theorem 1: (optimal cost) In the original network, the 

following holds w.p.l: 



limsup/(a;(t);y(i)) </* 

t^OO 



(52) 



Proof: Suppose that it is not true. Then there is a sequence 
{t„}^^i such that 



lim f{x{t„);y{tn)) > f* 



(53) 



From the Arzela-Ascoli Theorem lfT3l . there is a subsequence 
{^ufclfc^i which converges to a fluid limit. Lemma |7] implies 



lim /(a;(t„J) 



< 



(a) 



lim gix{tnj) 

A;— >-oo 



(54) 



(55) 

g* < f: (56) 
where (a) follows from the fact that for all m G A4, 

a;(")(t„J = T(")(t„J/l^T(™)(<„J (57) 

= (r("))*"»=(l)/l^(T("))*"'=(l) (58) 

^ r^")(l)/lTj.M(i) as A; ^ oo (59) 

^ (60) 

and that g{-) is continuous. (6) is because of Lemma|2l Thus, 
we have a contradiction, and the claim holds. ■ 
In the next subsection we will continue to prove the fea- 
sibility result, namely, the limit points of x{t) produced by 
Algorithm [T] are indeed feasible for OPT. 

C. Feasibility Proof 

Note that Algorithm [T] is designed to solve PEN. Thus, 
in order to prove that the scheduler produce feasible points 
for OPT, we need the following lemma, which connects the 
objective function value in PEN to the constraint in OPT. 

Lemma 8: The following properties hold for PEN: For 
large enough /3, we have 

+ <e/2 (61) 

for any solution {x*,Zp). 

Proof: For the ease of presentation, we only consider 
the penalty function as ( fTSl l. although the proof can be easily 
extended to general cases. Note that from Lemma |2] we have 



h{x;;y)+z;r<f:-fix;;y) 



(62) 



Thus, ( |6TT i holds by choosing sufficiently large (3. ■ 
Finally, we conclude this section by the following theorem, 
which states that the limit points produced by Algorithm[T]are 
always feasible for the original problem OPT. This, combined 
with Theorem [l] proves the optimality of Algorithm [T] for 
OPT. 

Theorem 2: (feasibility) For sufficiently large /3, we have 



0.5 >■ 



VmvsnY> hi{x{t)\y{t)) < 



(63) 



for any constraint function hi{x; y) in h{x\ y). 

Proof: Suppose that this is not true. Then there exist a 
sequence {tnY^ 



such that 



lim hi{x{tn)\y{tn)) > 



(64) 



□ Transmitter 
O Receiver 



-0.1 




From Arzela-Ascoli Theorem, there is a subsequence 
{^"fclfcLi which converges to a fluid limit. Thus, we have 



Fig. 1. An example wireless network with 7 links, where square nodes are 
transmitters, and round nodes are receivers. 



\\h{x{iy,y{l)) + -z{l) 



> 

(a) 



> 

{c) 
> 



h,{x{iy,y{l)) 
lim ht{x{tnj 

lim hi{x{tnj 



where (a) can be argued similarly as in the proof of Theorem 
[T] (6) is because for any i and t > we have Zi{t) > e, 
due to Algorithm [l] and (c) is because of the assumption in 
(|64] |. But according to Lemma |7] {x{t)),z(t)) solves PEN, 
and therefore Lemma |8] implies that 



||M*(i);y(i)) + ^(i)ll <e/2 



(65) 



Contradiction! Therefore the claim holds. ■ 
Thus, Algorithm [U produces feasible points for OPT, and 
achieves a cost which is arbitrarily close to /*, by properly 
selecting parameters (3 and e. 

V. Simulation 

In this section we verify the performance of Algorithm [T| 
through a simulation in a random wireless network where the 
network is as shown in Fig [T] There are 7 links in the network, 
where square nodes denote the transmitters, and round nodes 
denote the receivers. We simulate a special case of OPT, 
the following minimum power scheduling problem, which we 
denote as POW: 





0.8 


t5 




GC 




"TO 


0.6 


> 




< 


0.4 




0.2 




10 




40 


O 


30 


o 




OJ 
Ol 


20 


nj 


Ave 


10 









10 




■ Empirical Average 1 

■ Empirical Average 2 

■ Expected 



10 



10' 10- 
Time Siots 



10 



10" 




- Average cost 1 

- Average cost 2 

- Optimai cost 



10 10 
Time Slots 



10 



POW: 



min p^x (66) 

X 

s.t. a-Gx diO (67) 

xtO, = 1 (68) 



where p is a power vector whose each element pk corresponds 
to the power consumption when the independent set Gk is 
chosen. In the simulation we choose pk ~ WGkW^- Here, a 
corresponds to the arrival rate vector, which is assumed to 
be the only unknown parameter in the network. Thus, (l67T i 
corresponds to the rate stability constraint. 



Fig. 2. Convergence results of average cost with sources with different 
convergence behaviors. 



Fig. |2] shows the convergence results of the cost function 
(the bottom sub-figure) with slowly converging sources (the 
top sub-figure) after a simulation of 10^ time slots. In the 
simulation, we choose e — 10~^ and /3 = 5 x lO'^. It 
can be observed from the top sub-figure that our algorithm 
achieves the optimal cost. Further, by comparing the conver- 
gence results of the cost and the arrival processes, we can 
conclude that Algorithm [T| can track the uncertain parameter 
a dynamically. In the simulation, it is further observed that 
the maximum queue length in the network is around 10^, so 
that the constraint in (|67] | is clearly satisfied. 

VI. Conclusion 

In this paper we formulated a general class of scheduling 
problems in wireless networks with uncertain parameters, 
subject to the constraint that these parameters can be obtained 
from the empirical average values of certain stochastic network 



processes. We proposed a class of primal-dual type scheduling 
algorithms, and showed its optimality as well as feasibility 
using fluid limits. 

Appendix 
Proof of Lemma[5] 

Proof: We first prove 1). Let a sequence of functions 
frjpr^ ^ ^ -j givcu, which converge to a fluid limit 
(T, Y, Z). In the fluid limit, suppose that there is time t > 0, 
m E A4 and resource allocation modes j, k such that 

(Vm5(0)j > (Vmff(i))fc + e (69) 

where e > is a small constant. Then, since Vmg{t) is a 
continuous function of variable t, there is (Si > such that for 

all T e {t — Si,t + 6i), we have 

(V™5(t))j > (V,„.9(T))fc + e/2 (70) 

Further, since V„i5(r) = V„i5(S(t), ^(t); ^(t))) is con- 
tinuous as a function of variables {x{T),y{T),z{T)) (and 
therefore is absolutely continuous when restricted to a compact 
local region), there is an e' > such that \\{x,y,z) — 
{x{T),y{T), z{t))\\ < e' implies that 

{V„,g{x,z;y))^ > (V™5(S(t), ^(t); ^(t)))^ + e/4 

for all T £ {t — di,t + Si). Now we define 

A T(™'(r„T)/l^r(")(r„T) (71) 
y'^"(r) ^ Yirnr)/rnT (72) 
z--{t) ^ Z{r,,T)/r,,T (73) 

Then, the definition of fluid limits implies that there exists N £ 
N and $2 > such that for all n > and r £ (t - (S2, i + (52), 

\\{x^"{T),y^'^{T),z^"{T)) ~ {x{T),y{T),z{Tm < e' (74) 

Thus, by taking S = min((5i, 62) we have 

(V„,9K"(r),^-(r);y-(r))^. 
> {V„M^^- (r) , z^- (r) ; y^" (r)) , + e/4 

for all T £ {t — 6,t + 6). Further, by comparing the above 
definitions of x'^"{t), y'^"'{T) and z'^"{t) to that of x{t), y{T) 
and z{t) in ( fT9] l, (ISTT i and (l22T i. respectively, we conclude that 
they are essentially the same, except a difference in time scale, 
i.e., a;''" [t) = a;(r„t). Thus, the following holds in the original 
system: for any n > N and all r £ (r„(t — (5), r„(t + (5)), 

(V™5(a;(T),2(T);y(T))^. > {\7^g{x{T), z{T);y{T)) ^ + e/A 

Therefore, according to Lemma [3] ^^^"'^ is never chosen in 
any time slot during (r„(t — r„(t + (5)), and we have that 
Tj"^\t) is a constant during (r„(f; — (5),r„(i -I- (5)), from 
which we conclude that T^"'\t) = 0. Therefore, v'f'\t) = 
following the definition that v'f'\t) = Tj'"^(i)/7r(™). 

We next prove 2). Let u. be given as a feasible point 
of GRAD-ZBAR and e > be given. Since V^gi-) is a 



continuous function of t, there is (5 > and N E N such 
that for n > N and all r < (i — (5, t + (S), the following holds: 

\\V.g{x^- {t),z^- (r); y^"(r)) - V..g(*W, y(t))|| < e 

(75) 

Further, note that Lemma |3] implies that for any time slot in 

[fnit — (5), rn{t + (5)), we have 

V,g{x{T),z{T)-y{T)fu{T) < V,g{x{T),z{T);y{T)fu 

Thus, applying ( iTSl l to the above inequality we have 

V,5(S(0,5(i);yWrw(r) < V,,g(a;(0,2(t);yW)^M + ce 

for all r £ (r„(t — (5),r„(t + (5)), where c > is a proper 
constant. After summing over (r„(t — (5),r„(t + 6)) and 
dividing by r„ on both sides, we obtain 

V,g{x{t),zit)- mfiZ"^" it + S)- (t - (5)) 
< 2S{V,g{x{t),z{t); y{t)fu + ce) 

Finally, we let n ^ 00, and noting that (5 > can be taken 
arbitrarily small, we have 

V,g{x{t),z{t); y{t)fu{t) < V,g{x{t),z{t); y{t)fu + ce 
from which 2) holds since e > is arbitrary. ■ 
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